Efficient Learning of Goal-Oriented Push-Grasping Synergy in Clutter

نویسندگان

چکیده

We focus on the task of goal-oriented grasping, in which a robot is supposed to grasp pre-assigned goal object clutter and needs some pre-grasp actions such as pushes enable stable grasps. However, this task, gets positive rewards from environment only when successfully grasping object. Besides, joint pushing elongates action sequence, compounding problem reward delay. Thus, sample inefficiency remains main challenge task. In letter, goal-conditioned hierarchical reinforcement learning formulation with high efficiency proposed learn push-grasping policy for specific clutter. our work, improved by two means. First, we use mechanism relabeling enrich replay buffer. Second, policies are respectively regarded generator discriminator trained supervision discriminator, thus densifying rewards. To deal distribution mismatch caused different training settings policies, an alternating stage added turn. A series experiments carried out simulation real world indicate that method can quickly effective outperforms existing methods completion rate success less times motion. Furthermore, validate system also adapt goal-agnostic conditions better performance. Note be transferred without any fine-tuning. Our code available at https://github.com/xukechun/Efficient_goal-oriented_push-grasping_synergy

متن کامل

منابع مشابه

A Framework for Push-Grasping in Clutter

Humans use a remarkable set of strategies to manipulate objects in clutter. We pick up, push, slide, and sweep with our hands and arms to rearrange clutter surrounding our primary task. But our robots treat the world like the Tower of Hanoi — moving with pick-and-place actions and fearful to interact with it with anything but rigid grasps. This produces inefficient plans and is often inapplicab...

متن کامل

Using clinical information in goal-oriented learning.

We have proposed an extension to the Q-learning algorithm that incorporates the existing clinical expertise into the trial-and-error process of acquiring an appropriate administration strategy of rHuEPO to patients with anemia due to ESRD. The specific modification lies in multiple updates of the Q-values for several dose/response combinations during a single learning event. This in turn decrea...

متن کامل

Personalized e-Learning – a Goal Oriented Approach

A major drawback of current e-learning systems is that they are too disconnected from learner’s learning preferences and learning goals. There has been a high demanding for learner centric e-learning systems. Research on personalized e-learning is emerging in recent years. However, most of the current research is focused on user profile modeling, and learning styles research, etc. In this paper...

متن کامل

Learning End-to-End Goal-Oriented Dialog

End-to-end dialog systems, in which all components are learnt simultaneously, have recently obtained encouraging successes. However these were mostly on conversations related to chit-chat with no clear objective and for which evaluation is difficult. This paper proposes a set of tasks to test the capabilities of such systems on goal-oriented dialogs, where goal completion ensures a well-defined...

متن کامل

A Goal Oriented e-Learning Agent System

This paper illustrates a goal oriented approach to model agent mediated e-learning system. Most of the traditional e-learning systems are not learner-centric, and they often ignore the diversity of learner population; thus very often their service is not able to directly or effectively match the learners goal. From the learners point of view, we propose a goal oriented approach to develop an ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE robotics and automation letters

سال: 2021

ISSN: ['2377-3766']

DOI: https://doi.org/10.1109/lra.2021.3092640